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Abstract. Spectroastrometry is a technique which has the potential to resolve flux distributions on scales of 
milliarcseconds. In this study, we examine the application of spectroastrometry to binary point sources which are 
spatially unresolved due to the observational point spread function convolution. The technique uses measurements 
with sub-pixel accuracy of the position centroid of high signal-to-noise long-slit spectrum observations. With the 
objects in the binary contributing fractionally more or less at different wavelengths (particularly across spectral 
lines), the variation of the position centroid with wavelength provides some information on the spatial distribution 
of the flux. We examine the width of the flux distribution in the spatial direction, and present its relation 
to the ratio of the fluxes of the two components of the binary. Measurement of three observables (total flux, 
position centroid and flux distribution width) at each wavelength allows a unique separation of the total flux into 
its component parts even though the angular separation of the binary is smaller than the observations' point- 
spread function. This is because we have three relevant observables for three unknowns (the two fluxes, and the 
angular separation of the binary), which therefore generates a closed problem. This is a wholly different technique 
than conventional deconvolution methods, which produce information on angular sizes of the sampling scale. 
Spectroastrometry can produce information on smaller scales than conventional deconvolution, and is successful 
in separating fluxes in a binary object with a separation of less than one pixel. We present an analysis of the errors 
involved in making binary object spectroastrometric measurements and the separation method, and highlight 
necessary observing methodology. 

Key words, methods: data analysis - techniques: high angular resolution, spectroscopic - instrumentation: spec- 
trographs - stars: binary: general 



1. Introduction 

A telescope is limited by its resolution: traditionally, the 
smallest resolvable feature will typically be the size of the 
point-spread function (PSF). There are several contribu- 
tions to the PSF including the Earth's atmosphere (via 
seeing) for ground based telescopes, and the telescope it- 
self (via its optics). Conventionally, if the physical size of 
the flux distribution is smaller than the PSF, then all the 
information on that scale is blended and hence cannot be 
retrieved. For the special case of a binary object, there 
have been several techniques which have been proposed 
to extract the individual binary star spectra from the 
composite spectrum, involving single value decomposition 
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(Simon & Sturm 1994), Fourier methods (e.g. Hadrava 
1995), or Doppler tomography (Bagnuolo et ai, 1992). 

The technique of spectroastrometry allows the ob- 
server to gain some information on the distribution of 
flux on a spatial scale smaller than the PSF. It was 
originally discussed in the 1980s by Beckers, (1982), and 
Christy et ai, (1983) with more recent studies by Bailey 
(1998a, 1998b), Garcia et al, (1999), Takami et al, (2001), 
Takami, Bailey & Chrysostomou, (2003). Given that the 
technique has the potential to provide information about 
the flux distribution on milliarcsecond scales, it is surpris- 
ing that it has not been more widely exploited. 

Conceptually, spectroastrometry is easy to grasp: it 
involves taking a long slit spectrum, and relies on the ob- 
server being able to determine the centroid of a flux distri- 
bution to a fraction of a pixel. The exact spatial centre of 
a flux distribution may vary with wavelength if the com- 
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ponents making up that distribution contribute differing 
fractions of the total flux at a given wavelength. For ex- 
ample, a pair of objects in a binary system contribute dif- 
ferent amounts of flux at different wavelengths. Where one 
object has an emission line, the position centroid of the 
flux will move toward that object across that line before 
returning to the continuum position. For an absorption 
line, the opposite occurs. If the centroid position can be 
located accurately enough, then the relative contribution 
of the two objects may be calculated. Any astronomical 
source with a flux distribution which is asymmetric with 
respect to wavelength may be subject to the technique, 
such as binary stars (e.g. Bailey 1998a, 1998b), or out- 
flows and discs (e.g. Takami et al., 2003). 

More familiar techniques of deconvolution of long slit 
spectra have been presented by Courbin et al., (2000). 
Courbin et al. produce final results which are sampled at 
the Nyquist frequency of the observation (determined by 
the CCD chip pixels, see the discussion in §2 of Magain et 
al., 1998). Therefore, these techniques provide a method 
of producing a higher resolution than the observations, 
although they cannot provide any information on scales 
smaller than the Nyquist scale. 

Lucy & Walsh (2003, and references therein) produce 
an iterative technique, which is applied to the extraction 
of stellar spectra from a (crowded) convolved image. This 
is able to separate fluxes from objects with overlapping 
PSF by nominating a priori the objects which are point 
sources. This procedure performs well in crowded flelds, 
but is not sensitive to objects so close that their separation 
is smaller than the PSF. 

Here, we concentrate on a method of the separation 
of a "discrete" flux distribution - a binary object - into 
its components. Our aim is to produce a method which is 
able to extract the two fluxes from a binary object which 
is separated by less than a pixel, (an achievement impossi- 
ble with previously published methods) whilst remaining 
competitive for larger separations when compared to more 
traditional deconvolution techniques. 

The current observational technique of spectroastrom- 
etry (photocentre displacement) is supplemented by an in- 
vestigation into the width of the flux distribution and this 
provides the raison d'etre of this study: the addition of 
the information provided by the flux distribution's width 
closes the set of observables (total flux, photocentre posi- 
tion, and width of flux distribution) and unknowns (the 
two spectra and their separation) and thereby enables an 
unique separation of the composite spectrum into its com- 
ponent parts. We leave the problem of continuous flux 
distributions (e.g. a disc) until a later study (spectroas- 
trometry of a continuous flux distribution will yield the 
sub-pixel scale kinematics of the source). 

In §2 the binary object flux distribution its statistical 
properties are examined, before illustrating the flux sep- 
aration technique in §3. To provide a practical guide for 
this technique, a discussion of errors is provided in §4 and 
optimal observing strategies are given in §5. Discussion 
and Conclusions are given in §6 and §7 respectively. 



2. Observational statistics of the flux distribution 
of a convolved binary object 

The positional distribution of flux from any source is la- 
belled as J^x{x), where A is wavelength and x is the posi- 
tion along the slit. If the source is now assumed to be a 
binary object with a separation of many times the radius 
of either object, then the positional distribution of the flux 
is approximately 

^x{x) = fl,xSx,xi + f2,\Sx,X2, (1) 

where fi x and /2.A are the fluxes of the two objects lo- 
cated at positions xi and X2 respectively and da,b is the 
Kronecka delta function. 

We imagine that the flux is observed with a spectrom- 
eter and detected with a CCD detector with dimensions 
n X m pixels. The flux is sampled at discrete positions 
and wavelength: the pixel number in the spatial direction 
is labelled with index i, with i = corresponding to the 
bottom of the CCD frame and i = n corresponding to 
the top of the frame. Likewise the pixels in the; disper- 
sion (wavelength) direction are labelled with index j with 

< i < m. The flux in each CCD pixel is labelled Fij. 

During an observation, the flux distribution J^\{x) is 
flrst convolved with a function taking into account the 
seeing profile (for groimd based telescopes) , and telescope 
optics. We label this intrinsic "seeing" fimction S\{x). The 
flux is then binned into pixels as it is detected by the CCD. 
The measured distribution Fij is then 

F,^j = discrctize [J^\{x) (g) Sx{x)] . (2) 

We label the intrinsic width of the seeing function as as, 
which in the case of a Gaussian function is equivalent to 
the standard deviation. If the separation between the ob- 
jects \xi — X2\ is less than as, then the two objects are 
formally not resolved. Conventionally, separation of the 
individual spectra cannot be achieved. 

Observations have two main sources of noise (i) pho- 
ton counting errors, characterised by Poisson statistics of 
the number of photons in each pixel, and (ii) read noise 
generated in the process of charge transfer in the CCD 
during the read out stage. The relative contribution of 
these two error sources is dependent on the exposure time 
and brightness of the source observed, as well as the spec- 
trometer itself. First we consider the case of zero noise to 
develop the fundamental principles and technique, and we 
return to the effects of noise later. 

The three simple observables which may be measured 
from the flux distribution on the CCD are the total flux, 
the mean position of the flux, and the width of the flux 
distribution. In order to make the discussion as general as 
possible, henceforth we work with units of pixels. 

2.1. Total flux 

At a given wavelength A, corresponding to pixels {i = 

1 • • - n, j) the flux is distributed in position according to 
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eq.l and sampled onto pixel row i of the CCD. The "ex- 
tracted" total flux is the sum over all pixels in the spatial 
direction of the flux in each pixel: X]"=o = ftotj = 
fij + /2,i In traditional spectrographic work, this total 
extracted flux /tot.j (the spectrum) is the sole quantity 
gleaned from the observation. 

2.2. Position centroid 

The position of a distribution is often measured by the 
mean of the distribution - the 1^* moment. However, it 
is often the case that the mean is not particularly robust 
for non-Gaussian distributions (especially when noise is 
present, see e.g. Beers et al., 1990), and so a more so- 
phisticated approach is warranted. Good choices are the 
M-estimators such as Tukey's biweight, or Andrew's Sine 
(or wave) estimators (Goodall 1983, Press et al., 1986). 
These have a weighting function which penalises outliers 
from the distribution, which in this context translates to 
estimating the width over a specific pixel region. This will 
then avoid problems with extended wings of seeing profiles 
(and also cosmic ray hits, and noise-generated errors). 

We denote the position centroid of the distribution 
(however it is measured) as /i,: note that it can be mea- 
sured to fractions of a pixel hence Hj is a real number and 
not an integer. How does the centroid vary as the ratio 
of the flux of the two objects varies? This is the aspect 
of spectroastrometry which has received the most atten- 
tion (Bailey, 1998a, 1998b, Garcia et al, 1999, Takami 
et al, 2001; 2003). For equal fluxes fij = f2,j the cen- 
troid will lie exactly in the middle of flux distribution (at 

{X2-Xi)/2). 

The position centroid /ij is a direct probe of the rel- 
ative contribution of the two fluxes to the total flux. A 
simple relation can be written to calculate Hj in terms of 
the two fluxes: 

JtotJ JtotJ 

(e.g. Bailey, 1998a) assuming that both point sources are 
in the spectrometer slit. 

A particularly useful spectral feature which is used 
in the deconvolution technique below is an emis- 
sion/absorption line. These push the c;entroid toward the 
brighter object over a small wavelength range - clear in 
the centroid spectra of Mira (fig. 2 of Bailey 1998a) and 
HK Ori (fig. 9 of Baines et ai, 2004). Any slowly varying 
changes in the centroid can be subtracted out producing a 
centroid spectrum ^.j which is calibrated to be zero away 
from the line centre (in the continuum) and may be writ- 
ten as: 

d ftotj ftot ,cont 

rj + 1 Tcont + 1 

(see eq.3 of Takami et al., 2003), where /2,cont and /tot, cont 
are the continuum fluxes of object 2 and the total flux, 
and d is the separation of the objects in pixels. The flnal 
equality in eq.4 is expressed in terms of the ratio of the 



fluxes rj = f2,j/ fi,j and the flux ratio in the continuum 
r-cont = /2,cont//i,cont which is uscd later in §3. 

2.3. Width of the flux distribution 

The width of a distribution may be measured using several 
methods: the simplest characteristic width of the flux dis- 
tribution is the mean absolute deviation, and the most 
commonly used is the standard deviation. However, as 
with the position above, a more robust technique is re- 
quired in general. 

We denote the measured width of the distribution as 
(Tj . This will approach the limiting value as of the seeing 
fmiction Sx{x) as cither (i) the separation of the two ob- 
jects tends to zero, or (ii) the ratio of the less bright object 
flux to the brighter object flux tends to zero. If either of 
the objects becomes brighter/dimmer, then the observed 
width will change. The width aj will also change with the 
separation of the objects. If the separation of the objects is 
larger than the intrinsic as of the seeing, then the two ob- 
jects will start to be resolved, and the spectroastrometric 
technique is not required to separate the objects' spectra. 
Hence the width aj is a function of (i) the separation of 
the two objects, (ii) the intrinsic distribution's as, and 
(iii) the flux ratio of the two objects. 

A slight broadening of the convolved profile, i.e. aj > 

as, is present even in the contimmm, as both stars still 
contribute to the flux distribution. This is important in 
that the measured aj of the continuum cannot be used as 
an estimate for as- A separate measurement for as will be 
necessary (by observing a single star), particularly as the 
focus of the camera can change along the CCD producing 
a varying as in the dispersion direction. It may be possible 
to use the minimum of aj as an estimate for as where there 
is a particularly strong emission line present from one of 
the objects such that fij ^ f2.j, although this will still 
be a slight overestimate for as (see Appendix A. 2 for an 
evaluation of the errors associated with this procedure). 

In order to understand the behaviour of the changes 
in width of the flux distribution aj, with flux ratio, bi- 
nary object separation and intrinsic width as, we have 
performed an extensive series of numerical simulations. 
The flux ratio ranged from f2.j/.fi.j = 0.0 1.0 (stepsize 
0.05); the separation ranged from d = 3a s (stepsize 
O.las), and we have varied the intrinsic width as from 
2-10 pixels (similar to typical observations). Each reali- 
sation first convolves the binary object flux distribution 
with the function S\{x), and then maps the resultant flux 
onto pixels. We have also performed the calculation for dif- 
ferent functions S\{x), including a Gaussian, a Gaussian 
with boosted wings (the Gaussian func;tion multiplied by 
1 + 0.05{x/as)^), top-hat and triangular functions. In all 
cases we have measured the width of the resultant dis- 
tribution using Andrew's sine estimator, although identi- 
cal results are obtained with a bi-weight estimator, or a 
standard-deviation width. 
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Fig. 1. Top panel: variation of the width of the cUstri- 
bution with flux ratio. The symbols are the calculated 
data for an intrinsic convolving function of a Gaussian 
(open circles, width 5.0 pixels), a Gaussian with boosted 
wings (filled circles, width 6.2 pixels, see text) a top- 
hat (squares, width 3.0 pixel), and a triangle (triangles, 
width 4.6 pixels). The separation between the sources is 
3.0 pixels. All results have been divided by (d/as)^ {b from 
eq.5 listed in Table 1) to show the dependency of the width 
on the flux ratio only. Bottom panel: variation of the width 
with separation d between the sources for a flux ratio of 
f2,j/fi,j = 0.5. Symbols are the same as for the top panel. 
The solid lines in both panels are the fit of eq.5. 



Sx{x) 


a 


b 


c 


Gaussian 


0.116 


1.930 


3.468 


Gaussian with boosted wings 


0.118 


1.951 


3.468 


Top hat 


0.110 


1.858 


3.466 


Triangular 


0.130 


2.012 


3.448 



Table 1. Best fitting parameters to eq.5, S\{x) is the 
convolving function - see text. 



The results of some of these calculations are displayed 
in Fig.l. We find that the behaviour of the fractional 
change of the width of the distribution is almost indepen- 
dent of the function S\{x) and may be approximated by 
a fitting function. For objects with separations less than 
twice the intrinsic as (i-e- d < 2as) an excellent fit to the 
calculations is 



oj-as 



= a 



1- 



1 — min 



f2J 



.(5) 



The best fitting constants a, b, and c (with a search step- 
size of 0.001) in this expression are found to be slightly 
dependent on the exact shape of the convolving function 
S\{x). The best fitting values are listed in Table. 1. 

This expression is shown in Fig.l for different func- 
tions Sx{x). Eq.5 reproduces the numerical results within 



a few per cent of the calculated value for all of the convo- 
lution functions Sx{x) we have used (although it is slightly 
worse for the triangular function than the other functions). 
The error associated with use of this fit produces a sys- 
tematic shift in the separated fluxes (especially for the 
dimmer secondary object) in the practical application of 
the technique (see §3). To reduce this error, we can actu- 
ally measure the function Sx{x), again via a single object 
observation, and then use it to calculate either more ac- 
curate values of a, b, and c in Eq.5, or a "lookup" table 
of values for use in flux separation. Eq.5 is important in 
that this representation of the change in the width con- 
tains all of the convolution information of the binary flux 
distribution. 



3. Method of extraction of the individual spectra 

The total flux /totj is dependent on the fluxes of the two 
point sources fij and /2,j. The position displacement fx'j 
is dependent on fij, f2j and the separation d. Finally, 
the width of the flux distribution aj is dependent on fij, 
f2j and d. Therefore measurement of (i) the total flux 
/tot,j) (fi) the position displacement fXj and (iii) the width 
of the flux distributed in the spatial direction aj provides 
three observables for three unknowns (i.e. the fluxes of the 
two objects fij, f2,j, and their separation d). The set of 
three relations may be inverted to uniquely separate the 
flux distribution into its components. 

Previous attempts at spectroastrometric flux separa- 
tion have used the total flux /tot.j and position displace- 
ment fij as their only two observables. To be success- 
ful, prior knowledge of the objects must be obtained (e.g. 
Bailey 1998a knew the separation d of the binary). If this 
is not possible, then we have three unknowns (the two 
fluxes and source separation) and only the two observ- 
ables. Hence flux separation may not be uniquely achieved 
as we do not have a closed set: attempts to separate the 
flux using only two observables may produce misleading 
results. 

We use the following method to deconvolve the spec- 
tra: first for a given value of the separation d, we calculate 
the fiux ratio in the continuum Tcont with eq.5 and the ob- 
served width cTcont- Then for each row in the dispersion di- 
rection j we invert eq.4 using the observed centroid fx'j and 
continuum ratio rcont to produce the fiux ratio rj. Then 
with /totj = /ij +/2,j, we calculate the individual spectra 
/i.j and /2j. Using these values, we predict the width of 
the distribution using eq.5, and compare to the observed 
values using a simple calculation to evaluate the fit. 
Finally we repeat this procedure with differing values of 
the separation d until a best fit is found. 

We follow this procedure for two examples (see below). 
This suggested method applies in the case that the width 
as of the convolving function is known. ^ The separation d 



^ We note that this procedure may be changed if the prior 
information is varied: if the separation is accurately known 
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Fig. 2. Example A: the top two panels are the input fluxes of the individual objects (the primary object's flux 
is normalised to unity in the continuum). The next three panels are the observable quantities: the total flux, the 
relative change in width of the distribution, and the change in position ccntroid. The next two panels arc the best-fit 
extractions of the two fluxes, and the bottom two panels are the ratio of the extracted flux to the input fluxes from 
the top two panels. The separation of the binary is d= 2.0 pixels, and the convolving function is Gaussian with width 
as = 4.0 pixels. 



is the independent parameter which is varied to minimise 
for the fit. 



3.1. Example A 

In order to fully test the method, a series of emission and 
absorption "lines" of differing contrasts and widths are 



then a similar method could be devised which had the width 
as as the independent parameter. 



imposed on the continuum of both objects (see the top 
two panels in Fig. 2). These provide parts of the spectrum 
which have all combinations of the primary and secondary 
object brightening and dimming over the lines; from the 
left (in increasing pixels) the features correspond to (i) 
object 1 emission line only, (ii) object 2 absorption only, 
(iii) large emission line in object 2 such that it is brighter 
than object 1, (iv) both objects absorption, (v) object 1 
emission, object 2 absorption, (vi) both objects emission 
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log(SNR) 

Fig. 3. Variation of mean fractional deviation per pixel 
of the extracted fluxes from the input fluxes (eq.6) 
(top panel), and best-fit separation (bottom panel) for 
Example A with increasing signal-to-noise ratio (SNR). 
The solid line in the top panel is the mean fractional devi- 
ation per pixel expected for this example, with systematic 
error of 1% see §4. 

(vii) object 1 absorption, object 2 emission, (viii) both ob- 
jects absorption, but with the absorption minima offset, 
(ix) both objects emission with identical contrast and fi- 
nally (x) both objects with absorption lines of identical 
contrast. Note that the two fluxes are not meant to repre- 
sent any sort of object or binary system in particular: they 
are simply meant to illustrate the differing combinations 
of primary and secondary flux. The continuum flux ratio 
is constant at all wavelengths at rcont = 0.2. 

An "observation" is then made: this consists of con- 
volving the input with S\{x) assumed to be a Gaussian 
with a width (identical to the standard deviation in this 
case) of as = 4.0 pixels. The resultant distribution is then 
binned into pixels in position and dispersion. The observ- 
ables are then calculated - the total flux, the width, and 
position offset. To derive the relative change in width, one 
of the input fluxes is set to zero, and an "observation" 
taken of a single flux component, and the width calcu- 
lated. The derived observables for a source separation of 
2.0 pixels arc displayed in the third-fifth panels of Fig. 2 
for a range of signal-to-noise ratios (SNR). 

One feature in the width distribution is particularly 
worthy of note. When the fainter object in the continuum 
has a large emission line - feature (iii) in fig. 2 ~ the width 
of the flux distribution aj can be seen to decrease in the 
centre of the line, producing a double-peaked profile. The 
distribution width aj has a max:imum for a flux ratio of 
unity. How does the width aj change from the line wing of 
feature (iii) to line centre? In the wing, object 2 is much 
less bright than object 1, and the flux ratio in eq.5 is less 
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Fig. 4. Map of mean fractional deviation per pixel (left) 
and error in the best-fit separation (right) for a set of simu- 
lations using the input spectra of Example A, but varying 
the flux ratios and the object separation. The contours 
are logarithmically spaced with steps of 0.2dex. The bold 
contours correspond to an error of 0.1 (i.e. 10%). In the 
left panel, the highest contour corresponds to an 4.0% 
error and is situated in the top left corner. In the right 
panel the highest contour corresponds to 2.6% and is in 
the middle-top right of the panel. 

than unity. As the line emission increases, the flux ratio 
increases (and hence so does the width aj) until the emis- 
sion from both objects is equal and the flux ratio is unity. 
Here, the width of the distribution aj reaches its max- 
imum value. As the line emission increases further such 
that object 2 is brighter than object 1, the width aj de- 
creases because cij is dependent on the minimum of the 
ratios rj and l/rj (this is because there must be no differ- 
ence in aj whichever of the two objects is denoted object 
!)• 

This produces a maximum in the width aj which is 
offset in wavelength from the maximum in emission, and 
hence a double peaked profile in aj is observed. The peaks 
in aj then correspond to a flux ratio of unity. 

The bottom four panels of Fig. 2 show the results of 
the separation method. Panels six and seven display the 
extracted fluxes, and panels eight and nine show the ex- 
tracted fluxes divided by the input fluxes (if the extrac- 
tion is perfect, then these panels should be unity). It is 
clear from examination of these four panels that the flux 
separation method achieves good results even for SNR of 
100, and that as SNR increases, the accuracy of the flux 
separation becomes more accurate. To find the best-flt, 
the distance between the components d was varied from 
0-5 pixels with a stepsizc of 0.001 pixels. This procedure 
yielded d ^ 1.956, 1.989, 1.979 and 1.986 pixels for SNR 
of 100, 220, 450, and 1000 respectively. 
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To assess how weU the flux separation was achieved 
we calculate a mean fractional deviation per pixel of the 
extracted flux: 



j=m 

.f.d. = -y 



i=o 



- 1 



+ 



/2,i 



f2J 



- 1 



(6) 



where the subscript "ex" and "in" refer to the extracted 
and input fluxes respectively. The m.f.d. per pixel for 
Example A along with the best-fit separation d as a func- 
tion of SNR are shown in Fig. 3. The separation asymp- 
totes to the input separation of 2.0 pixels, and the m.f.d. 
per pixel decreases with increasing SNR. However, even 
for SNR> 1000, the separation is not exact, and indeed, 
is not expected - this is an empirical method, and hence 
will contain errors (sec §4 below). The solid line on the 
top panel of Fig. 3 is the expected m.f.d. per pixel for this 
simulation as described below in §4. 

Figs. 2 & 3 illustrate the performance of the method as 
the SNR varies for a constant continuum flux ratio. We 
also investigate the method for differing continuum flux 
ratios rcont and distance between the objects d. For this 
we have conducted a series of simulations for the same fea- 
tures in the spectra (top two panels in Fig. 2). However, the 
continuum flux ratio rcont ~ /2.cont/./i,cont is varied from 
unity to 1/50, and the separation d of the sources ranges 
from 0.1 pixels to 4.0 pixels (i.e. 0.025-1.0 of the standard 
deviation of the Gaussian convolving function). For all 
of these simulations, the noise level is fixed to produce 
SNR= 220 (similar to that in the simulations of Courbin 
et ai, 2000 for comparison of the two methods). 

For each pair of rcont and d we calculate the m.f.d. per 
pixel of the extracted fluxes and plot this as a contour map 
in the left panel of Fig. 4. The contours are spaced logarith- 
mically in intervals of 0.2, and the bold contour marks the 
m.f.d. per pixel of 0.1 (log(m.f.d.) = -1). We can clearly 
see that the technique performs best for large separations 
d and for flux ratios rcont close to unity. We also note that 
accurate extraction of the fluxes can be achieved for object 
separations of less than one pixel (a feature of the tech- 
nique which, to our knowledge, is unique). We have also 
calculated the fractional error in the best-fit separation 
and show the contour map of this in the right hand panel 
of Fig. 4. Again the contours are spaced in 0.2dex, and the 
bold line corresponds to a fractional error of 0.1 (i.e. 10%). 
The main feature of this map is that the best-fit separa- 
tion d is not very sensitive to the continuum flux ratio, 
and that accuracy of better than 10% should be possible 
for input separations of larger than around one pixel. 

3.2. Example B 

To provide a direct comparison with previous methods, we 
adopt exactly the same problem considered by Courbin et 
al., (2000) for a second test. This consists of a quasar and 
a star with a featureless continuum (see Courbin et ai, 
2000, §3.1). The quasar spectrum used is the mean of all 



objects taken from the 2QZ project^ (Croom et al, 2004). 
The two point sources are separated by 2.0 pixels, and the 
convolving Gaussian to provide the seeing has a width of 
4.0 pixels. The relative continuum brightness ranges from 
^■cont = /2,cont//i,cont =0.2-1 from the blue to the red end 
of the spectra. The mean flux ratio of the two components 
is larger in this case than in Example A, and therefore we 
should expect more accurate results. 

Fig. 5 shows the results for this test in a similar fash- 
ion as Fig. 2, and Fig. 6 shows how the m.f.d. per pixel in 
the extracted fluxes and the best-fit separation vary with 
SNR. As can be seen, the extraction does indeed produce 
more accurate results than for Example A. For the cases 
of SNR = 200-300 (exactly the same as in Courbin et al.) 
the m.f.d. per pixel is « 0.05. We estimate that this is 
larger than the typical m.f.d. in the results of Courbin et 
al. (from the inserts in their Fig. 3) by a factor of ^ 2. This 
indicates that the spectroastrometric method of extracting 
fluxes is not quite as efficient as Courbin et al.'s method 
for this particular example - unfortunately Courbin et al. 
do not discuss differing SNR simulations. 

As in Example A, we examine the technique for dif- 
fering values of continuum ratios and component separa- 
tions (the continuum ratio is defined by the fluxes at pixel 
j — 1). Fig. 7 displays the contour map of m.f.d. per pixel 
and best-fit separation (as Fig. 4 for Example A), for varia- 
tion of the continuum ratio (unity - 1/50) and component 
separation {d = 0.1-4.0 pixels). We have fixed the SNR 
at 220. Again, the most accurate results are obtained for 
larger component separations d, and for larger flux ratios. 

4. Random and systematic errors 

We can calculate the expected fractional deviation per 
pixel in a straightforward way. At each pixel j, small 
variations in the fluxes fij and f2,j {6fij and Sf2j 
respectively) produce a variation in the flux ratio Srj, 
where Srj/i-j = Sf2j/f2,j — Sfij/fij. As the sum of 
the two fluxes is a constant for any j, then the small 
variations 6fi,j and 6/2. j are equal and opposite in sign 
{S.fi,j = -Sf2.j)- Hence, Srj/rj = -(1 + l/rj)Sfij/fij. 

The mean fractional deviation per pixel (as defined in 
eq.6) is the sum of the absolute fractional errors: 



m.f.d. = MA^+l^^^'^l 



1 + 



1 



(7) 



In the limit that the flux ratio rj <C 1, the expected 
error in rj is calculated in Appendix A.l to be 



1 



ac y2SNR V d ) 



+ br\ 



syst 



(8) 



where a, 6, and c are the constants in eq.5, and (5r|gyg^ is 
the systematic error from the fitting formula eq.5 

We can use this estimate to calculate the expected 
m.f.d. error for Examples A & B. Inserting a systematic 
error of (5r|syst = 0.01 (i.e. 1%), the resultant predictions 

^ http://www.2dfquasar.org/ 
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Fig. 5. Example B: the top two panels are the input fluxes of the individual objects. The next three panels are 
the observable quantities: the total flux, the relative change in width of the distribution, and the change in position 
ccntroid. The next two panels arc the best-fit extractions of the two fluxes, and the bottom two panels arc the ratio 
of the extracted flux to the input fluxes from the top two panels. The separation of the binary is d = 2.0 pixels, and 
the convolving function is Gaussian with width as = 4.0 pixels. 



for the m.f.d. per pixel arc plotted as the solid line on 
the top panels of Figs. 3 & 6. We note that, in deriving 
the expressions above, we have assumed that rj <C 1, and 
so our calculations arc not directly applicable to cither 
Examples A & B, which have mean flux ratios per pixel of 
0.18 and 0.36. Therefore we expect a closer flt to the nu- 
merical results for Example A than for Example B. which 
is seen in Figs. 3 & 6. The predicted m.f.d. per pixel are 
good estimates for the calculated values. 



5. Observing practicalities 

The method is clearly able to separate binary object fluxes 
if the SNR is high enough. However, there are several prac- 
tical aspects of the technique which need to be noted. 
Whilst some of these are discussed in detail in Bailey 
(1998a), and also in Baines (2004) it is useful to provide 
a list including aspects gleaned in the development of the 
method. 

1. Both components of the binary must be fully in the 
spectrograph slit else unknown fractions of the flux from 
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Fig. 6. Variation of mean fractional deviation per pixel 
of the extracted fluxes from the input fluxes (eq.6) 
(top panel), and best-fit separation (bottom panel) for 
Example B with increasing signal-to- noise ratio (SNR). 
The solid line in the top panel is the m.f.d. per pixel ex- 
pected for this example, with systematic error of 1% see 
§4. 



m.f.d. flux error separation error 




0.5 1 1.5 0.5 1 



logio(fi,c.ont / fa, com) 
Fig. 7. Map of mean fractional deviation (left) and error 
in the best-fit separation (right) for a set of simulations 
using the input spectra of Example B, but varying the flux 
ratios and the object separation. The contours are loga- 
rithmically spaced with steps of 0.2dex. The bold contours 
correspond to an error of 0.1 (i.e. 10%). In the left panel, 
the highest contour corresponds to an 2.6% error and is 
situated in the top left corner. In the right panel the high- 
est contour corresponds to 2.6% and is in the middle-top 
left of the panel. 



one component will be observed, and so the method will 
fail. Use of a wide slit will degrade the spectral resolution 
of the final observation a little, although this is a small 
sacrifice as it is of paramount importance that both ob- 
jects are fully within the slit. If it is not known a priori 
whether the object is a binary or not, then a wide slit 
should be used. 

2. Full sampling of the convolved flux distribution is 
essential - as already stated at least 4 pixels (or equivalent 
for dithering patterns) must sample this function to avoid 
Nyquist sampling problems. 

3. For some spectrographs, it is easily possible to re- 
peat the observation with the instrument rotated through 
180°. This is useful as it allows any instrumental misalign- 
ment problems to be subtracted - this point is particularly 
stressed in Bailey (1998a) and is worth repeating here. 

4. The flux separation method will only operate in the 
spatial direction parallel to the slit axis. If the line-of- 
centres axis is orientated normal to the slit axis, then no 
flux separation will be possible. At intermediate angles be- 
tween these two axes then the best-fit separation will be 
equal to the actual separation multiplied by the cosine of 
the angle between the axes. Information on the position 
angle of the distribution may be obtained by combining 
measurements of two observations taken with the spec- 
trometer slit at two different angles on the sky (e.g. see 
Baines et al, 2004). 

5. An important aspect of the method is to observe 
a single object with a similar position on the sky as the 
intended target in order to measure the intrinsic width of 
the convolution function. These single object observations 
can also be used to calculate series of convolutions of bi- 
nary flux distributions, in order to generate a "look-up" 
table of values to be used in the deconvolution, instead 
of using eq.5. In practice, this may be difflcult: the seeing 
can change with time, making single star observations less 
useful. When one object has a particularly strong emission 
line, then the measured width of the flux distribution at 
line centre aj may \k used as an estimator for the intrinsic 
width Us (see Appendix A. 2). 

6. Discussion 

Given the volume of published work dedicated to the de- 
convolution of images, why are we investigating new tech- 
niques to achieve the same result? As the image is sam- 
pled in discrete intervals (the pixels), then any deconvolu- 
tion method can only increase its resolution (a result from 
sampling theory). The fundamental aspect of the spec- 
troastrometric technique is that it utilises changes in the 
flux distribution convolved with the PSF. As this convo- 
lution takes place before the distribution is sampled onto 
the pixels, then information on scales smaller than the 
pixels can be retrieved (this is a different approach to the 
problem than deconvolution methods, such as those de- 
scribed in Courbin et al., 2000). This is because the infor- 
mation about the binary is spread throughout the PSF. 
Therefore, if the PSF itself is well sampled then extraction 
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of point sources separated by less than the traditional spa- 
tial Nyquist sampling interval (pixel) is possible. Hence 
extraction of fluxes is possible at a much higher spatial 
resolution than with previous methods. Of course, there 
is a price to pay for this advantage over deconvolution 
techniques. Wc have assiimcd a priori two point sources, 
and so have imposed a criterion on the flux distribution 
(a feature not present in traditional deconvolution meth- 
ods) . Also this is essentially an empirical method - method 
noise will always be present in the solution. 

Spectroastrometry is likely to achieve best results for 
the cases when specific known binary sources are tar- 
geted. Serendipitous binary discovery may prove frustrat- 
ing when it is clear from the data that a binary component 
is present, although little else may be derived because, for 
example, the secondary component is not fully in the slit, 
or that the SNR is not high enough to extract the indi- 
vidual fluxes reliably. The technique is sufficiently general 
that any binary object may be examined (although triple 
systems may cause problems!). 

How powerful is the technique? For ground based tele- 
scopes, the dominant contributor to the point spread func- 
tion is the seeing, which typically ranges from one-few 
arcsec. With an equivalent signal to noise ratio of several 
hundred (often achieved with modern instruments) 

and pixel sizes of a few tenths of an arcsec (typi- 
cal of spectrographs), the position centroid may be de- 
termined to ~milliarcsecond accuracy. Spectroastrometry 
should then be a natural complementary technique to the 
growing optical/IR interferometry field which operates at 
a similar angular scale. 

Future investigations into spectroastrometry will con- 
centrate on continuous flux distributions, such as discs, 
where the main goal will be to measure the kinematics 
of the emitting gas on sub-pixel scales. This technique 
is difficult, in that high SNRs are necessary (and hence 
the observational aspects may be challenging), but does 
promise great rewards. 

7. Conclusion 

We describe the technique of spectroastrometry and 
present a method to separate individual binary object 
fluxes from the point-spread function which is present in 
all observations. The method makes use of the total flux, 
the position offsets and the characteristic width of the 
flux distribution in the spatial direction: the three obser- 
vational quantities it is possible to measure directly from 
a long slit spectrograph image. The observing technique 
to achieve this is not difficult for careful observers. 

We have demonstrated that the performance of the 
technique is similar to previously published deconvolution 
methods which reconstructs the original flux distribution 
on the sampling scale ('^pixels). Although spectroastrom- 
etry cannot achieve this complete mapping, it is the only 
technique in our knowledge, which is able to successfully 
separate the fluxes of binary objects with sub-pixel sepa- 
rations. 



The technique of spectroastrometry is a natural coun- 
terpart to interferometry, as it provides spectral informa- 
tion (and hence kinematics) at similar resolutions, which 
with available instrumentation is typically milliarcsec- 
onds. 
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Appendix A: Random and Systematic Errors 

If the width ctj of the spatial distribution of the flux is 
sampled over at least two pixels (so the profile will ex- 
tend over at least 4 pixels), then the extent of the profile 
does not effect the error estimates (essentially this is an 
expression of Nyquist sampling). 

The two error sources will be from photon counting 
statistics and read-noise. Where the read-noise errors are 
dominant in determining the pixel signal-to-noise ratio 
(SNR), it can be justified that as long as the errors are 
symmetrically distributed around zero then their net effect 
on the /Uj and aj should be zero (especially when robust 
location and width estimators are used). 
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A.l. Error in the flux ratio 

When photon noise dominates, the error in the mim- 

1 /2 

ber of photons Fij in each pixel is simply 6Fij = F.^J . 
The signal-to-noise ratio of the position-integrated flux is 

SNR, = {j:iFij)/{J:iFijy^' = {lliFijY'^- The Stan- 
dard errors in the position centroid and width (mean and 
standard deviation) are the familiar expressions dfi'^ = 

aj/ny^ and Sdj = (Tj/{2njY/'^, where Uj is the number 
of counts in the position-integrated profile (e.g. Topping, 
1972). Assuming that the SNR is the same across the spec- 
trum (i.e. SNR=SNRj, identical for all j) we find 



SNR 



and 5a n = —=r^ — . 

\/2SNR 



(A.1) 



c 

o 

'i 2 

u 

0) -1 



The more the seeing function deviates from a Gaussian, 
then these expressions for the errors 5(Tj and Sji'j become 
less accurate (e.g. see Beers et al. 1990). 

For a given error in the position centroid and width 
we can estimate the typical error in using the technique. 
Let us assume that the random error in the intrinsic width 
of the seeing profile is small in comparison with 5a j, i.e. 
5a s 5a j (see A. 2 for the consequences of incorrectly 
deriving as). Furthermore, let us assume that the random 
error in the separation 5d is small, and so the dominant 
error source in the flux ratio is via the variation in the 
width aj (our method of solution uses the separation d as 
the independent variable to minimise the x^j and hence 
5d is dominated by systematic errors). 

Rearrangement of eq.5 to make the flux ratio rj = 
f2,j/fi,j the subject allows a simple error analysis to be 
performed. This produces the random error in the flux 
ratio 5rj of: 
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Fig. A.l. Top panel: logarithm of the m.f.d. per pixel ver- 
sus the systematic error in the intrinsic width of the seeing 
as- Bottom panel: the variation of the best fit separation. 
See text. 

over time. An estimate of as could be made from the bi- 
nary objects' observations by assuming that the smallest 

measured value of aj occurs when one of the objects com- 
pletely dominates the output: this is correct in the limit 



h,j/fi,j 



0. 



as then aj 



as- If this approximation is 



5r 



3 I rand 



ac 



5aj_ 
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(A.2) 



where we have also assumed that rj <C 1, i.e. the case for d as 



used, then as will in general be slightly overestimated. 
What are the consequences from this assumption? 

Both the best-fit separation between the components, 
and the individual fluxes (notably the continuum flux ra- 
tio) will have an associated error. From examination of 
eq.5 we see that the fractional change in the separation 
caused solely by an change in as (5a s) is 

5d 5as 1 



0-s 



1 - ^ 



(A.4) 



small flux ratios. There will also be a systematic error in 

this ratio frjsyst related to the inaccuracy in the fitting 
formula eq.5, leading to the total error in the flux ratio as 



= ^'^jlra„d+ '^^jlsyst 
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syst 
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(A.3) 



where, for the final expression, we have inserted eq.A.l. 

A.2. Systematic errors in the intrinsic as and its time 
variation 

We have assumed that the measurement error in the in- 
trinsic width of the flux distribution as is zero. In practice 
this may not be the case as there is no high SNR obser- 
vation of a single object or where the seeing is changing 



which assumes that there is no resultant error in the flux 
ratio (strictly this is never the case, but we proceed in 
the spirit of producing relatively simple error estimates). 
Note that an overestimate of as leads to a systematic 
underestimate of d. 

We have calculated a series of simulations for Example 
A in the text with high SNR of 3000, and systematically 
changed the value of as used in the; flux separation. We 
take the actual value of as and multiply by a fraction 
ranging from 0.95-1.05 to mimic both an underestimate 
and an overestimate of as from the data. The results are 
shown as the fllled circles in Fig.Al: the top panel is the 
logarithm of the m.f.d. per pixel, and the bottom panel is 
the best-fit separation between the components. It is clear 
that « 1 (close to unity) produces the best results by 
far. This underlines the desirability for accurate measure- 
ment of as- When (f) > 1.03, the numerical method fails, 
as all the values of [a — as)/as < 0.0. This produces a 
discontinuity in the trends in Fig.A.l. 
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Plotted as a solid line on the bottom panel is our es- 
timate (above, with b = 1.93) for the variation in d. For 
each value of (/) wc calculate a mean value of (a — erg) /a 
per pixel for eq.A.4 (this produces the main difference be- 
tween our estimated values from eq.A.4. and the numerical 
values) . The calculated value predicts a zero crossing in d 
at <p = 1.017, as there the mean value of {a — as)/<Js is 
zero. 



